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£NJ . We prove a central limit theorem for linear triangular arrays under weak dependence 

conditions. Our result is then applied to the study of dependent random variables sampled 
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1 Introduction 



Let {£i}«ez be a sequence of centered, non essentially constant and square integrable real 
valued random variables. Let {a n> j , — k n < i < k n } be a triangular array of real numbers 
such that for all n G N, J2i=-k n a n i > °- We 

are interested in the behaviour of linear 

triangular arrays of the form 

X n ,i — 0"n,i Ci ) ^ — 0,1,... , i — k n , • • • , k n , (1 .1) 

where (k n ) n >i is a nondecreasing sequence of positive integers satisfying k n ► +oo. We 

— n— ++oo 

work under a weak dependence condition introduced in Dedecker et al. (2007). We first prove 
a central limit theorem for linear triangular arrays of type (II. ID (Theorem 13.11 of Section [3]). 
Applying this result, we then prove a central limit theorem for the partial sums of weakly 
dependent sequences sampled by a transient Z- valued random walk (Theorem l4.1l of SectionH]). 
This result extends the results obtained by Guillotin-Plantard & Schneider (2003). Peligrad 
& Utev (1997) derive a central limit theorem for triangular arrays of type (jl.ip under mixing 
conditions. Unfortunately, mixing is a rather restrictive condition, and many simple Markov 
chains are not mixing. For instance, Andrews (1984) proved that if (£j)i>i is independent 
and identically distributed with marginal 23(1/2), then the stationary solution (£j)i>o of the 
equation 

£n = 2^"-i + e «) ' & independent of (e»)»>i (1.2) 

is not a-mixing in the sense of Rosenblatt (1956). We have indeed a(cr(^o), cr(£ n )) = 1/4 for 
any n. For any y £ R, let [y] denote the integer part of y. The chain satisfying (jl.2p is the 
Markov chain associated to the dynamical system generated by the map T(x) = 2x — [2x] 
on the space [0,1], equipped with the Lebesgue measure, and it is well known that such 
dynamical systems are not a-mixing in the sense that a(a(T), a(T n )) does not tend to zero as 
n tends to infinity. Withers (1981) proves triangular central limit theorems under a so-called 
/-mixing condition, which generalizes the classical notions of mixing (such as strong mixing, 
absolute regularity, uniform mixing introduced respectively by Rosenblatt (1956), Rozanov 
& Volkonskii (1959) and Ibragimov (1962)). The idea of /-mixing requires the asymptotic 
decoupling of the 'past' and the 'future'. The dependence setting used in the present paper 
(introduced in Dedecker et al, 2007) follows the same idea. In Section [5] we give lots of 
pertinent examples satisfying our dependence conditions. Coulon-Prieur & Doukhan (2000) 
proves a triangular central limit theorem under a weaker dependence condition. However, they 
assume that the random variables £j are uniformly bounded. Their proof is a variation on 
Lindeberg-Rio's method developed by Rio (1996,1997). Also using a variation on Lindeberg- 
Rio's method, Bardet et al. prove a triangular central limit theorem, requiring moments of 
order 2 + 5, 5 > 0. In Section [21 we introduce the dependence setting under which we work 
in the sequel. Models for which we can compute bounds for our dependence coefficients are 
presented in Section [5j At least, we give an application to parametric estimation by random 
sampling in Section [6l 

2 Definitions 

In this section, we recall the definition of the dependence coefficients which we will use in the 
sequel. They have first been introduced in Dedecker et al. (2007). 
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On the Euclidean space R m , we define the metric 

m 

di(x,y) = ^2\ Xi - yi\. 

i=l 

Let A = (JmgN* ^-rn where A m is the set of Lipschitz functions / : R m — ► R with respect to 
the metric di. If / € A m , we denote by Lip(/) := sup^y the Lipschitz modulus of 

/. The set of functions / 6 A such that Lip(/) < 1 is denoted by A. 



Definition 2.1 Let £ be a K m -valued random variable defined on a probability space (O, .A, P) ; 
assumed to be square integrable. For any a -algebra Ai of A, we define the 62-dependence 
coefficient 

2 (AU)=sup{||E(/(O|Al)-E(/(6)ll2, /€A}. (2.3) 

We now define the coefficient 6k,2 for a sequence of u-algebras and a sequence of R- valued 
random variables. 

Definition 2.2 Let (£i)iez be a sequence of square integrable random variables valued in R. 
Let (Aii)i£z be a sequence of a -algebras of A. For any k £ N* U {00} and n £ N, we define 

0k,2( n ) = max jSuj){9 2 (M p , . . ,€j l )),p + n <ji< ... < ji} 

l<l<k I 

and 

2 (n) = 0oo,2(n) = sup 9 k>2 {n) . 

k&i* 

Definition 2.3 Let (Ci)iez fre o sequence of square integrable random variables valued in R. 
Let (7Wj)j 6 z be a sequence of a -algebras of A. The sequence (£i)iez said to be 62-weakly 
dependent with respect to (A4i)i^z if @2{ n ) ► 0. 

n—>+oo 

Remark: 

Replacing the || • H2 norm in (|2.3p by the || • ||i norm, we get the 0\ dependence coefficient 
first introduced by Doukhan & Louhichi (1999). This weaker coefficient is the one used in 
Coulon-Prieur & Doukhan (2000). 



3 Central limit theorem for triangular arrays of dependent 
random variables 

Let {X n j, n € N, — k n < i < k n } be a triangular array of type (jl.ip . We are interested in 
the asymptotic behaviour of the following sum 

k n k n 
i = hn i = kn 

Let (M.i)i£% be the sequence of u-algebras of A defined by 

Mi = a (Cj , j < i) , % G Z . 

In the sequel, the dependence coefficients are defined with respect to the sequence of a-algebras 
(A^i)igz- We denote by a\ the variance of S n . 
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Theorem 3.1 

Assume that the following conditions are satisfied : 
{A x ) (i) liminf„^ +00 ° l 2 > 0, 



(ii) lim n _> +00 a n 1 max_ fen <j< fcn |a n>i | = 0. 
(A2) i- s an uniformly integrable family. 



(A3) * s bounded above by a non-negative function g(-) such that 



x 1 — > x 3//2 g(x) is non-increasing, 



DC 



3 < e < 1 , ^2T 5 (2 i£ ) < 00. 



i=0 



Then, as n tends to infinity, converges in distribution to Af(0, 1). 



Remark: 



• Theorem 2.2 (c) in Peligrad & Utev (1997) yields a central limit theorem for strongly 
mixing linear triangular arrays of type (II. lj) . They assume that {|£i| 2+<5 } is uniformly 
integrable for a certain 5 > 0. Such an assumption is also required for Theorem 2.1 in 
Withers (1981) for /-mixing arrays. In Coulon-Prieur & Doukhan (2000), the random 
variables £j are assumed to be uniformly bounded. 

• The proof of Theorem 2.2 (c) in Peligrad & Utev (1997) relies on a variation on Theorem 
4.1 in Utev (1990) (see Theorem B in Peligrad & Utev, 1997). The proof of Theorem 
13. H which is postponed to the Appendix, also makes use of a variation on Theorem 4.1 
in Utev (1990) (see also Utev, 1991). 

• If O^in) = O (n~ a ) for some positive a, condition (.A3) holds for a > 3/2. 

4 Central limit theorem for the sum of dependent random 
variables sampled by a transient random walk 

4.1 The main result 

Let (E,£,n) be a probability space, and T : E ^> E a bijective bimeasurable transformation 
preserving the probability \x. We define the stationary sequence (£i)ieZ = (T l )i£z from (E,/j,) 
to E. Let (Xi)i>x be a sequence of independent and identically distributed random variables 
defined on a probability space (O, A, P) with values in Z and 



n 



S n = y^Xj, n > 1, So = . 



i=l 



For / G L 



(/x) and u G S7, we are interested in the sampled ergodic sum 



n-l 
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By applying Birkhoff 's ergodic Theorem to the skew-product: 

U : ttx E ft x E 

(uj,x) i — ^ (ctuj, T Ul x) 

where a is the shift on the path space ft = Z N , we obtain that for every function / G L 1 ( / u), 
the sampled ergodic sum converges P tg) /i-almost surely. A natural question is to know if the 
random walk is universally representative for L p , p > 1 in the following sense: there exists 
a subset fto of ft of probability one such that for every u G fto, f° r every dynamical system 
(E,£, (j,,T), for every / G L p , p > 1, the sampled ergodic average converges //-almost surely. 
The answer can be found in Lacey et al. (1994) if the AVs are square integrable: The random 
walk is universally representative for L p , p > 1 if and only if the expectation of X\ is not 
equal to which corresponds to the case where the random walk is transient. In that case, 
it seems natural to study the fluctuations of the sampled ergodic averages around the limit. 
From Lacey's theorem (1991), for any H G (0,1), there exists some function / G L 2 (P(g> /x) 
such that the finite-dimensional distributions of the process 

H-i 

k=0 

converge to the finite dimensional distributions of a self-similar process. Unfortunately, this 
convergence on the product space does not imply the convergence in distribution for a given 
path of the random walk. A first answer to this question is given in Guillotin-Plantard & 
Schneider (2003) where the technique of martingale differences is used. Let us recall that 
this method consists (under convenient conditions) of decomposing the function / as the sum 
of a function g generating a sequence of martingale differences and a cocycle h — h o T. In 
the standard case, the central limit theorem for the ergodic sum is deduced from central 
limit theorems for the sums of martingale differences, the term corresponding to the cocycle 
being negligeable in probability. In Guillotin-Plantard & Schneider (2003), only functions 
/ generating a sequence of martingale differences are considered. In this section, where we 
prove a central limit theorem for 02-weakly dependent random variables sampled by a transient 
random walk, this reasoning does not hold anymore. We apply Theorem 13.11 of Section [3l 

In the sequel, the random walk (S n ) n >o is assumed to be transient. In particular, for every 
x G Z, the Green function 

+oo 

G(0,x) = J2nSk = x) 

k=0 

is finite. For example, it is the case if the random variable X\ is assumed with finite absolute 
mean and nonzero mean. It is also possible to choose the random variables (Xi)i>\ centered 
and for every x G R, 

nn- 1/a S n <x) >F a (x), 

n— >+oo 

where F a is the distribution function of a stable law with index a G (0, 1). Stone (1966) has 
proved a local limit theorem for this kind of random walks from which the transience can be 
deduced. The expectation with respect to the measure \i (resp. with respect to P, P® //) will 
be denoted in the sequel by E M (resp. by Ep, E). 
For every function / G L 2 (/x) such that E M (/) = 0, we define 

a 2 (f) = 2 £ G(0, x)E,(ff o T x ) - E,(f) . 
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Let us now state our main result whose proof is deferred to Subsection 14.31 



Theorem 4.1 

Let f be a function inL 2 (/j) such thatK^f) = 0. Assume that {f °T X ) X& % satisfies assumption 
(A3) of Theorem \3. 1[ Assume that o~ 2 (f) is finite and positive. 
Then, for F-almost every u£0, 



1 n 

J2f°T S *M — — ^Af(0,a 2 (f)) 

\/n ^-^ n-*+oo 
V 1 n 



in distribution. 

n— *+oo 

fc=0 

Remark: 

1. In the particular case where (foT x ) x ^i is a sequence of martingale differences, we recognize 
Theorem 3.2 of Guillotin-Plantard & Schneider (2003). Indeed, assumptions are satisfied using 
orthogonality of the / o T x ''s and then, a 2 (f) = (2G(0, 0) - 1)E M (/ 2 ). 

2. The stationarity assumption on the sequence can be relaxed by a stationarity 
assumption of order 2, that is: 

Vz E Z, Var^j = Var£i and 
Vi < j , Cov(&,£j) = Cov(^i,Ci+i-i) • 

4.2 Computation of the variance 

The random walk (5 n ) n >o is defined as in the previous section. The local time of the random 
walk is then defined for every x G Z by 

n 
i=0 

The self-intersection local time is defined for every x G Z by 

71 



ij=0 



and can be rewritten using the definition of the local time as 

a(n, x) = y~" j N n (y + x) N n {y) 



Let / be a function in L 2 (/x) such that E (Lt (/) = 0. For every well, 

n 

fc=o xez 

In order to apply results of Theorem l3.ll we need to study, for any fixed u G Q, the asymptotic 
behaviour of the variance of this sum, namely 



0- 2 n (f)=^ 



2 N 



fc=0 
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The variable u will be omitted in the next calculations. We have 

2 

2 



<{f) = E 



n J]iVn(x)/or a 

= Yl N n (x)N n (y)E^foT x -yf) 

= E N n (y + z)N n (y)E ll (foT z f) 

y,zeZ 

= ^«(v)E,(/of/). 

We are now able to prove the following proposition: 
Proposition 4.1 If G(0, x)E fl (f f o T x ) < +oo, then 

a n(f) F-q.s. ( ^2tf\ 
n n^+oo 

Proof of Proposition I4.lt 

Let us assume first that the function / is positive. For every < m < n, we denote by W m>n 
the random variable 

n 

- E £i{*-s,^>iw/°n- 

i,j=m xGZ 

Then, since / is positive, for every A;, m, n such that < k < m < n, 

that is {W m .n)m,n>S) is a subadditive sequence. Then, 

i,j=0 xEZ 
n 

= EE F ( Si ~ S i = x ) E m(/ / ° TX )' b y Fubini Theorem 

i ,j=0 rcgZ 

n i— 1 

(n + 1)E M (/ 2 ) + 2 ^ £ £ P(5' i _ j = x)E M (/ / o I*)) 
i=i j=o zez 

n i 



((n + 1)E„(/ 2 ) + 2^^X^ W = *(/ / n) • 



Now, using that 



i iToo E E p ^ = / ° TX ) = E / ° TZ ) - e m(A 

we conclude that 

mVo^ = 2 _ 2 ^ / o T x ) < oo . 



So the sequence (W m)n ) TO)n >Q satisfies all the conditions of Theorem 5 in Kingman (1968). 
Hence 

W^n (/2) _ 2 G(0 , x)E A4 (/ /of), 

n n— * +00 ' — » 

By remarking that Wo,n = ~ a n(f)i Proposition 14.11 follows for positive functions /. If the 
function / is not positive, we can decompose it as 

/ = f 1 {f>o} ~ (-Z) 1 !/^} 

and, for all x G Z, / o T x as 

/ o T x = f o T x l{f oT x> y - (-/ o T x )l{f oT x <0 y . 
Then, a simple calculation yields 

Mf f° TX ) = Mf f TX 1{/>0;/oT^>0}) + M(-f)(~f ° TX ) 1 {/<0;/oT-<0}) 

- E M (/ (-/ o T°) 1{/>0;/o^<0}) " M(~f) (/ T ' X ) 1 {/<0;/oT^>0}) • 

By applying the previous reasoning at each of the four positive terms of the right-hand side, 
Proposition 14.11 follows. A 

Remark: 

Let us consider the unsymmetric random walk on nearest neighbours with p > q. Then, for 

x > 0, 

G(0,x) = (p-q)- 1 , 

and for x < —1, 

-\(V 



G(o,x) = ( P - q y 

A simple calculation gives 

a 2 (h-hoT) = 2^ [2G(0,x) - G(0,x + 1) - G(0,x- l)]E /t (/i hoT x )-2E^(h 2 )+2K^h hoT) 

x&Z 

= -2^-EJh 2 ) + WJh hoT) — 2 ^ - ^ V ( EJh hoT x ). 

4.3 Proof of Theorem 14.11 

Let us define M n = max \Si-\- First note that 

0<fc<n 

n 

^2foT s *= ]T N n (x)foT x . 



k=0 \x\<M n 
We want to apply Theorem 13. II to the triangular array 



X n>i = oT\ne N, -M n <i<M n \ . (4.4) 

n 
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As / belongs to L 2 (^), the family {(/ o T*) 2 | p is uniformly integrable as it is stationary. 

It remains to prove that assumption (Ai) of Theorem 13.11 is satisfied for the triangular array 
defined by (|Q|> . 

Proof of {Ai)(i): 

First, by Proposition 3.1. in Guillotin-Plantard & Schneider (2003), Ylff^-M a ni = a ^\f^ 
converges P-almost surely to 2G(0,0) — 1 as n goes to infinity. Then, by Proposition 14. 1\ we 
know that <r 2 (/)/n converges to cr 2 (f), which is assumed to be positive. Hence (Ai)(i) is 
satisfied. A 

Proof of (Ai)(ii): 

Now, by Proposition 3.2. in Guillotin-Plantard & Schneider (2003), we know that for every 
P>0, | 

max \a n A = —= max N n (i) = o I n p ~2 ) p — almost surely. 

-M„<i<M„ ' s pn iez v / 



So J -jI\jj max -M„<i<M„ l a n,i| tends to zero P— almost surely and assumption (Ai)(ii) is 

satisfied. A 

Hence Theorem 13.11 applied to X^-M a n,i f ° T l , Proposition 14.11 and Slutsky Lemma 
yield the result. A 



5 Examples 

In this section, we present examples for which we can compute upper bounds for #2( n ) for any 
n > 1. We refer to Chapter 3 in Dedecker & al (2007) and references therein for more details. 

5.1 Example 1: causal functions of stationary sequences 

Let (E, £, Q) be a probability space. Let (ej)jgz be a stationary sequence of random variables 
with values in a measurable space S. Assume that there exists a real valued function H defined 
on a subset of 5 N , such that H(sq, e_i>£_2, . . • , ) is defined almost surely. The stationary 
sequence (£ n )nez defined by £ n = H(e n , e n _i, £ n -2, ■ ■ •) is called a causal function of (ei)igz. 

Assume that there exists a stationary sequence (ei')i'ez distributed as (ej)igz arid in- 
dependent of (ei)i<o- Define £* = H(e n \ e„_i', £ n -2 , ■ ■ •)• Clearly, ^* is independent of 
-^o = i i < 0) and distributed as £ n . Let (^2(i))i>o be a non increasing sequence such 
that 

||E(|&-£| \M )\\ 2 <S 2 (i). (5.5) 
Then the coefficient 62 of the sequence (£n)n>o satisfies 

2 (») < • (5-6) 

Let us consider the particular case where the sequence of innovations (ej)jgz is absolutely 
regular in the sense of Volkonskii & Rozanov (1959). Then, according to Theorem 4.4.7 in 
Berbee (1979), if E is rich enough, there exists (e^iez distributed as (e«)igz and independent 
of (£i)i<o such that 

Q(e l + e'i for some i > k \ F ) = - \\Qs„\^ - Qe k \\ v , 
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where = (Ek,£k+i, ■ ■ ■), = , i < 0), and || • ||^ is the variation norm. In particular 
if the sequence (e»)igz is idependent and identically distributed, it suffices to take e\ = e, for 
i > and e\ — e" for i < 0, where (e" )i<=z is an independent copy of (ej)iez. 

Application to causal linear processes: 

In that case, £ n = Sj>o a j £ n-j, where (o 3 -) 3 >o is a sequence of real numbers. We can choose 

i-1 

j>i j=0 

From Proposition 2.3 in Merlevede & Peligrad (2002), we obtain that 

/ fl3(^(£k,k<0),a(e k ,k>i-j)) 1 / 2 

5z{i) < ||e - 41b 2J a il + X>j| [2 2 / Ql («)J 

where Q £0 is the generalized inverse of the tail function x i— > Q(|eo| > £)• 



5.2 Example 2: iterated random functions 

Let (£ n )n>o be a real valued stationary Markov chain, such that £ n = F(£ n _i,e n ) for some 
measurable function F and some independent and identically distributed sequence (ej)i>o in- 
dependent of £o- Let be a random variable distributed as £o and independent of (£o> (e«)i>o)- 
Define Q = F(£*_ 

-i) e n)- The sequence (£^)n>0 is distributed as (£ n )n>o and independent of 
£o- Let A4i = <?(£.,■, < j < i). As in Example 1, define the sequence {5 2 (i))i>o by (|5.5p . The 
coefficient #2 of the sequence (£n)n>o satisfies the bound (|5.6|) of Example 1. 

Let /i be the distribution of £0 and (£^)n>o be the chain starting from £q = x. With these 
notations, we can choose 82(1) such that 

5 2 {i) > Mi-Gb = (// iii& ? -aiM^M^)Y /2 • 

For instance, if there exists a sequence (d,2(i))i>o of positive numbers such that 

M*-$h<d 2 (i)\x-y\, 

then we can take 62(1) = d 2 (i)\\£o — QIU- For example, in the usual case where \\F(x,So) — 
F(y,£o)\\2 < K \ x — v\ for some k < 1, we can take c^W = 

An important example is £ ra = /(£ n -i) + £ n for some K-Lipschitz function /. If £0 has a 
moment of order 2, then #2(2) < ^11^0 — • 



5.3 Example 3: dynamical systems on [0, 1] 

Let I = [0, 1], T be a map from I to I and define X{ = T l . If ^ is invariant by T, the sequence 
(Xj)j>o of random variables from (I,fj) to / is strictly stationary. 

For any finite measure v on /, we use the notations v(h) = Jj h(x)v(dx). For any finite 
signed measure v on /, let ||z/|| = |z^|(I) be the total variation of v. Denote by ||(?||i,a the 
L 1 -norm with respect to the Lebesgue measure A on /. 
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Covariance inequalities. In many interesting cases, one can prove that, for any BV function 
h and any k in L (I, «), 

\Cov(h{X ),k{X n ))\ ^OnllfcC^nJIIidl/ill^ + lldhll), (5.7) 

for some nonincreasing sequence a n tending to zero as n tends to infinity 

Spectral gap. Define the operator C from L 1 (7, A) to L 1 (/, A) via the equality 



1 

C(h)(x)k(x)d\(x) 



o 



f h{x)(k o T)(x)d\{x) where h G L 1 (I, A) and A; G L°°(J, A). 
■/ o 



The operator C is called the Perron-Frobenius operator of T. In many interesting cases, the 
spectral analysis of £ in the Banach space of BV- functions equiped with the norm \\h\\ v = 
\\dh\\ + ||/i||i,a can be done by using the Theorem of Ionescu-Tulcea and Marinescu (see Lasota 
and Yorke (1974) and Hofbauer and Keller (1982)). Assume that 1 is a simple eigenvalue of 
C and that the rest of the spectrum is contained in a closed disk of radius strictly smaller 
than one. Then there exists a unique T-invariant absolutely continuous probability p whose 
density is BV, and 

C n (h) = \{h)f^ + m n (h) with W l {h)\\ v < Kp n \\h\\ v . (5.8) 

for some < p < 1 and K > 0. Assume moreover that: 

I* = {ffi 7^ 0} is an interval, and there exists 7 > such that / M > 7 _1 on (5.9) 

Without loss of generality assume that I* = I (otherwise, take the restriction to in what 
follows). Define now the Markov kernel associated to T by 

P(h)(x) = ^MM , (5.10) 

It is easy to check (see for instance Barbour et al. (2000)) that (Xo, X\, . . . , X n ) has the same 
distribution as (Y n ,Y n _i, . . . ,Yq) where (li)«>o is a stationary Markov chain with invariant 
distribution p and transition kernel P. Since ||/<?||oo ^ II/^IId ^ 2 1 1 1 1 ^ \\g\\ v , we infer that, 
taking C = 2K 7 (||^|| + 1), 

P n (h)= t i(h)+g n with || 5 n||oo<Cp n ||^||,. (5.11) 

This estimate implies (|5.7p with a n = Cp n (see Dedecker & Prieur, 2005). 

Expanding maps: Let ([0^, Oj+i[)i<j<Ar be a finite partition of [0, 1[. We make the same 
assumptions on T as in Collet et al (2002). 

1. For each 1 < j < N, the restriction Tj of T to ]aj,Oj+i[ is strictly monotonic and can 
be extented to a function Tj belonging to C 2 ([aj, aj+i]). 

2. Let I n be the set where (T n )' is defined. There exists A > and s > 1 such that 
mf xeIn \(T n Y(x)\>As n . 

3. The map T is topologically mixing: for any two nonempty open sets U, V, there exists 
n > 1 such that T- n (U) n V / for all n > n . 
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If T satisfies l.,2. and 3., then (|5.8p holds. Assume furthermore that (|5.9p holds (see Morita 
(1994) for sufficient conditions). Then, arguing as in Example 4 in Section 7 of Dedecker 
& Prieur (2005), we can prove that for the Markov chain (Y^)i>o and the cr-algebras Mi = 
c(Yj , j < i), there exists a positive constant C such that #2(2) < Cp l . 

Remark: 

In examples 2 and 3, the sequences are indexed by N and not by Z. However, using existence 
Theorem of Kolmogorov (see Theorem 0.2.7 in Dacunha-Castelle & Duflo, 1983), if (Xj)j e ^ 
is a stationary process indexed by N, there exists a stationary sequence (li)jgz indexed by Z 
such that for any k < I E Z, both marginals (Y/%, . . . , Yj) and (Xq, . . . , ^Q-fc) have the same 
distribution. Moreover, in examples 2 and 3, the sequences are Markovian, hence 0]f(n) = 
62 (n) for any n > 1. We then apply Theorem 14.11 to the sequence (li)jgz- The limit variance 
can be rewritten as 

a\f) = 2 G(0, x) Cov(f(X ), f(X [A )) - Var(/(X )) . 

6 Application to parametric estimation by random sampling 

We investigate in this section the problem of parametric estimation by random sampling for 
second order stationary processes. We assume that we observe a stationary process (&)igN at 
random times S n , n > 0, where (S n ) n >o is a non negative increasing random walk satisfying 
the assumptions of SectionlU In the case where the marginal expectation of the process (£i)ieN) 
m, is unknown, Deniau et al. estimate it using the sampled empirical mean rh n = — X^=i 6Sr 
They measure the quality of this estimator by considering the following quadratic criterion 
function: 

a(S) = lim (nVarm n ). 

n— >+oo 

In the case where (Cov(£i, £n+i)) n£ N ^ s m ' j we have 

+00 

a(S)= Cov (^Si,^S| fc | +1 ) < oo- 

k=— 00 

We then get Corollary 16.11 below, which gives the asymptotic behaviour of the estimate 
rh n after centering and normalization. 

Corollary 6.1 Let us keep the assumptions of Section^ on the random walk (S' n ) n gN an d on 
the process (£i)«eN- Assume moreover that So = and that (S n +i — S n ) ne ^ takes its values in 
N*. Then, for F '-almost every uj G ft, 

y/n {rh n — m) > A/"(0, a(S)) . 

n— >+oo 

Proof of Corollary I6.lt 

Corollary 16.11 can be deduced from Theorem 14.11 of Section [5] applied to / = Id — m. We have 
indeed a 2 (f) = o(5). A 

Let k G M+, k > 1. A K-optimal law for the step S n +i — S n is a distribution which 
minimizes a(S') under the constraint E(5' ra+ i — 5 n ) < k. We say that the sampling rate is less 



12 



or equal to 1/k. Deniau et al. (1988) give sufficient conditions for the existence of a K-optimal 
law of the step S n+ i — S n . In the case of an A.R.(l) model £ n = p£ n -i + \p\ < 1) with e a 
white noise, we know from Taga (1965) that there exists a unique K-optimal law Lq given by : 

• Lq = 8% if p < (the sampled process is the process itself), 

• Lq = [1 — (k — [k])] <5[ k j + (k — [k]) <5[ k ]+i if p > ([•] denotes the integer part). 



7 Appendix 

This section is devoted to the proof of Theorem 13,11 of Section [3l 
Proof of Theorem (BTTt 

In order to prove Theorem l3.lt we first use a classical truncation argument. For any M > 0, 
we define : 



<Pm ■ ■ 

and 



x i ► <pm(x) = (x A M) V (-M) 



1 X' I — > (/9 M (x) = X — fM{x) ■ 

We now prove the following Lindeberg condition : 

a~ 2 £ E((^-(X n>i )) 2 ) — -0. (7.12) 

i— 

We have, for n large enough, 

The last right hand term in the above inequalities is bounded by 

which tends to zero as n goes to infinity, by assumptions (A±) and (^2)- 

By (|7.13p we find a sequence of positive numbers (e n )n>i such that e n > 0, and 

n— >+oo 

( E fam^/m^ K ,|)) ^ E <* • (7 - 14) 



Let us now prove that (|7.14p yields 



To prove (|7.15p . we need the following Lemma : 



kn 

a~ 2 Vax I £ ^ nCT "(^n,i) ) — — ^0. (7.15) 
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Lemma 7.1 Assume that {r\i)i^z is centered and satisfies conditions (A2) and (A3) of Theo- 
rem \3.1\ then for any reals —k n <a<b<k n , 



Var I a n,iVi ) < C °»*>< 

\i=a / i=a 



mtfi C = sup fcZ (E77?) + 2 A /su PieZ (Er?2) E£i MO- 

Before proving Lemma 17.11 we finish the proof of (|7.15|) . 

For any fixed n > 0, and any —k n < i < k n such that a n ^ 7^ 0, define : 

V n ,i = <f enan/ ^(^) - E Lp^^lM^)^ . 
If a n ^i = 0, let V ny i = 0. As for any fixed n > 0, —k n < i < k n , the function 

is 1-Lipschitz, we have for all / > 1, for all k > 1, 

< 2 "(0<< 2 (0, 

where K, n = (K,i)-fc n <j<fc n and £ = (£i)iez- Hence, for any fixed n, the sequence 
(V nj i)-k„<i<k n satisfies assumptions (^2) and (A3) of Theorem 13.11 Moreover, as 

a n ^ £nannanA (ti) = V £nan (a n>i £i), 

applying Lemma 17.11 yields 



( r- 2 Var( V £nan (an4i)) < C n a~ 2 ^ < i5 (7.16) 



with C n = sup_ An < i < jfcn (El/ 2 .j + 2y S up_ fcn < i < (fcn (eV^J E/=i^i,2(0- Jt remains to prove 
that the right hand term in (|7.16p converges to as n goes to infinity. 
We have, for n large enough, 

E K 2 ! < E \a nJ \) • ( 7 - 17 ) 

Hence we conclude with ()7.14p . 
Proof of Lemma I7.lt 

b b b b 

Var ( ^2 a n>j r]jj = ^ a\j Var(?? i ) + ^ ^ a n ,; a n j Covfa, 77;) 
J=a i= a i=a j=a;j^i 

b b b 

< J2 a l,j Var(? ?i ) + ^a 2 i ^ | Cov(r?j, 7?j)| 

j=a i=a j=a;j^i 
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by remarking that |a n> i| \a n j\ < |(o^j + a? n j ). 

Then for any j > i, using Cauchy-Schwarz inequality, we obtain that 

|Cov( M )| = lE^E^-IM))! 

< \\ Vi \\ 2 ||E (Vj\Mi)\\ 2 

< Wmlh 6*1,20' - i)- 

As (r]i)i£Z is centered, and as (rjf)i € z is uniformly integrable, we deduce that 

b b 

Var ( 

j=a j=a 



with C = supj gZ (E?]?) + 2ysup igZ (E77?) YlbLi 01,2(0 which is finite from assumptions (A 2 ) 
and (A 3 ). A 

Define Z n ^ by 

y e „ g „(^n,i) - E(<p e „ gn pf W[i )) _ y> Ew<r „ (X W|i ) - E (<p eti<Tn (X w ,,)) 

By (17.15P we conclude that, to prove Theorem 13 .1\ it is enough to prove it for the truncated 
sequence (Z nt i) n>Q _ kn<i<kn , that is to show that 



^2 Z n>i ^-^M(0,1). (7.18) 

i— k)i 



The proof is now a variation on the proof of Theorem 4.1 in Utev (1990). Let 

dt(X,Y) = \Ee itx -Ke itY \ . 
To prove Theorem 13. 11 it is enough to prove that for all t, 




with 7] the standard normal distribution. We first need some simple properties of the distance 
dt- Let X,Xx,X 2 ,Yi,Y 2 be random variables with zero means and finite second moments. 
We assume that the random variables Y\ , Y 2 are independent and that the distribution of Xj 



coincides with that of Yj, j = 1,2. We define A t {X) = d t [X, r]VEX 2 j. We have then the 
following inequalities : 

Lemma 7.2 (Lemma 4.3 in Utev, 1990) 

A t (X) < ^|t| 3 E|X| 3 , 
A t (Y l +Y 2 )<A t (Y l )+A t (Y 2 ), 
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d t {x 1 + x 2 ,x 1 )< t ^ (ex| + (EA^EXf) 1 / 2 ) , 

t 2 

d t (r)a,r)b) < — \a 2 - b 2 \. 
We next need the following lemma : 

Lemma 7.3 Let < e < 1. There exists some positive constant C(e) such that for all a E Z, 
/or a// d G N*, it (Xa=o+l ^W) * s bounded by 

(a+v I \ a+u \ 

t=o+l \ j :2i>h 1 / e / i=a+l J 

where h is an arbitrary positive natural number and with g introduced in Assumption (A3) of 
Theorem \3.1[ 



Before proving Lemma 17.31 we achieve the proof of Theorem 13. 11 By Lemma 17.31 we have 

/ \ / fan \ / 

ck Yl Z ^A =A t [ z n* < c ^ £ ) h2/e E n\Zn£)+m 



With 8(h) = [h^ + £. 0> h l ^9^ £ )V'n 2 T,t- k yn,- 



From (|7.15p we deduce that o~ n /o~' n ► 1. 

Hence using assumptions (A±)(i) and (A3) we get 5(h) > 0. 

h— ++00 



k n k n 



On the other hand we have 

V E(\Z n ^\ 3 ) < 2 ^ V Var(Z njl ). (7.19) 
For any fixed n > 0, and any — k n < i < k n such that a n ^ 7^ 0, define : 

W nt i = <Ps n a n /\a n ,i\(Ci) ~ E (^„cr„/|a nii | (&)) • 

If a ni « = 0, let Wn,* = 0. As for any fixed n > 0, — £;„ < i < k n , the function 

X ^ ^na n /|a n ,i|(^ - E (^an/K.ilfe)) 

is 1-Lipschitz, we have for all / > 1, for all /c > 1, 

where W. )Tl = (W n ,i)-k n <i<k n and £ = (&)iez- Hence, arguing as for the proof of Lemma EH 
we prove that for any fixed n, the sequence (Wn,i)-fcn<i<ifc„ satisfies assumptions (A2) and 
(A3) of Theorem 13.11 Therefore, for any reals —k n < a <b < k n , 

Var (j2 Z n \ < Co-'" 2 £ a 2 n>i (7.20) 
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with C = supj gZ (2E£ 2 ) +2y sup igZ (2E£ 2 ) X^^i ^1,2(0 which is finite from assumptions (A2) 
and (-A3). Applying (|7.20p with a = b, we get that the right hand term of (|7.19p is bounded 

k Ca? 

by 2 £nfr Si=-fc — rsn-> which tends to zero as n tends to infinity, using assumption {A\){i) 
and the fact that e n ► 0. 

n— >+oo 

Consequently 



inf /i 2 / £ E(|Z n ,i| 3 ) + 5(/i) 



h>- . 

It achieves the proof of Theorem 13. 11 A 



Proof of Lemma I7.3t Let h G N*. Let < e < 1. In the following, C, C(e) denote constants 
which may vary from line to line. Let K e be a positive constant greater than 1 which will be 
precised further. Let v < n e he. We have 



a+v 



\i=a+l 



a+v 



i=a+l 



a+v 



< - 4 \t\ 3 h 2 ' £ e (I^mI 3 ) (7.21) 



i=a+l 



since \x\ is a convex function. 



Let now v > k £ . Without loss of generality, assume that a = 0. Let 5 e = (l-e 2 + 2e)/2. 
Define then 

m = [ v £ ], B = I it G N : 2 -1 (u - [«*«]) <um< 2 _1 u} , 



A= ^ 



(w+l)m u 
G N : < u < v, a li ^ ( m / v ¥ E a nr 

i=um+l i=l 



Following Utev (1991) we prove that, for < £ < 1, AnBis not wide for v greater than k £ 
We have indeed 



\AnB\ = \B\ — \Ar\B\ > \B\ - \A\ > 



3 
2 



where A denotes the complementary of the set A. We can find k £ large enough so that |Ani?| 
be positive. 

Let u £ An B. We start from the following simple identity 



Q = y ] ^n,i 



um 



(u+l)m v 

= Y^ Z n,% Yl ^ n ' i ^ ^n,i 

i=l i=um+l i=(u+l)m+l 

= Q1 + Q2 + Q3. 



(7.22) 



By Lemma l7.2i 



d t (Q, Qi + Q 3 ) = d t (Q, Q - Q 2 ) < - (eQ 2 + (EQ^EQ 2 ) 1 / 2 



(7.23) 
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Using (TT231) and (TT20J) . we get 

V 

dt(Q, Qi + Q 3 ) < Ct 2 v^a'~ 2 <r (7-24) 

i=l 

Now, given the random variables Qi and Q3, we define two independent random variables g\ 
and 53 such that the distribution of gi coincides with that of Qi, i = 1,3. We have 

d t {Qi + Q 3 , 91+93) = \^(e ltQl -l)(e itQ3 -1) -E{e itQl -l)E(e itQ3 

< \\E(e itQl -l)|| 2 ||E(e^ 3 - 1 - E(e'* Q3 -1) \M um )\\ 2 

< 2\t\ Z n ,i\\ 2 V 1*1 °'n l (Ei=( U +l)m+l K,i|) ^(m + 1) 

by (|7.20p . Definition 12.21 and Assumption (^3) of Theorem 13.11 Hence 

V 

dt(Qi + Qs,9i + 93) < Ct 2 f(v)a'- 2 Y, <i, (7-25) 

i=i 

where f(v) = v 3 ^ 2 g(v e ) is non-increasing by assumption (^3) of Theorem 13.11 
We also have by Lemma 17.21 

M9i + 93) < M9i) + Mas)- (7.26) 
Finally, still by Lemma 17.21 and using Definition 12.21 we have 

t 2 



d t {r,^nm,^{{9i+93) 2 )) < -\E(Q 2 )-E((g 1+ g 3 ) 



t 2 



< 2" l E ^2) + 2E(Q!Q 2 ) + 2E(Q 2 Q 3 ) + 2E(Q 1 Q 3 )\ 

< Ct 2 (v^ + f(v))a'- 2 ir<>- (7-27) 



i=i 



Combining (|7.24p - (|7.27p . we get the following recurrent inequality : 

A t (Yh=1 Z n,i) < A t {Yh=1 Z n,i) + A t [Yn=(u+l)m+l Z n,i 



+ Ct 2 (v^ + f(v))a'- 2 EU 



a 2 ■ 



1 



for v > k £ he > k £ . 

We then need the following Lemma, which is a variation on Lemma 1.2. in Utev (1991). 

Lemma 7.4 For every e G ]0, 1[, denote 5 £ = (1 — e 2 + 2e)/2. Let a non- decreasing sequence 
of non-negative numbers a(n) be specified, such that there exist non-increasing sequences of 
non-negative numbers e(k), j(k) and a sequence of naturals T(k), satisfying conditions 

T{k)<2- 1 (k + [k Se }), 
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a{k) < max (o(T(s)) + j(s)) 

ko<s<k 

for any k > ko with an arbitrary kg £ N*. Then 

a(n)<a(n ) + 2 j(2 j ) , 

k <2i<n 

for any n > ko, where one can take hq = 2 C with c > j^f- . 



Proof of Lemma 17. 4t The proof follows essentially the same lines as the proof of Lemma 
1.2. in Utev (1991) and therefore is omitted here. A 



We now apply Lemma 17.41 above with 

•k k$ = K £ He , 

•k for k > ko, T(k) = m&x{ukmk, k — Uktrtk — rrik} where Uk and mk are defined from k as 
u and m from v (see the proof of A D B not wide) , 

★ c < l fa(2) ( we ma y nee d to enlarge k £ ), 

★ for s > k , 7 (s) = C t 2 (s 1 ^ + /(a)) , 

★ for a > fco, a(s) = sup max 



Applying Lemma 17.41 yields the statement of Lemma 17.31 A 
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